Basic Statistics

Raw Counts

Name Value
Rows 7,422,037
Columns 38
Discrete columns 19
Continuous columns 19
All missing columns 0
Missing observations 1,499,052
Complete Rows 7,263,435
Total observations 282,037,406
Memory allocation 2.1 Gb

Percentages

Data Structure

Missing Data Profile

Univariate Distribution

Histogram

Bar Chart (with frequency)

## 11 columns ignored with more than 50 categories.
## FL_DATE: 365 categories
## ROUTE: 6536 categories
## ROUTE_NAME: 6536 categories
## ORIGIN: 360 categories
## ORIGIN_NAME: 358 categories
## ORIGIN_CITY: 343 categories
## DEST: 360 categories
## DEST_NAME: 358 categories
## DEST_CITY: 343 categories
## DEP_TIME: 1441 categories
## ARR_TIME: 1441 categories

QQ Plot

## Warning: Removed 83 rows containing non-finite values (stat_qq).
## Warning: Removed 83 rows containing non-finite values (stat_qq_line).

## Warning: Removed 88 rows containing non-finite values (stat_qq).
## Warning: Removed 88 rows containing non-finite values (stat_qq_line).

Correlation Analysis

## 11 features with more than 20 categories ignored!
## FL_DATE: 365 categories
## ROUTE: 6495 categories
## ROUTE_NAME: 6495 categories
## ORIGIN: 357 categories
## ORIGIN_NAME: 357 categories
## ORIGIN_CITY: 342 categories
## DEST: 357 categories
## DEST_NAME: 357 categories
## DEST_CITY: 342 categories
## DEP_TIME: 1440 categories
## ARR_TIME: 1440 categories

Principal Component Analysis

## 11 features with more than 50 categories ignored!
## FL_DATE: 365 categories
## ROUTE: 6495 categories
## ROUTE_NAME: 6495 categories
## ORIGIN: 357 categories
## ORIGIN_NAME: 357 categories
## ORIGIN_CITY: 342 categories
## DEST: 357 categories
## DEST_NAME: 357 categories
## DEST_CITY: 342 categories
## DEP_TIME: 1440 categories
## ARR_TIME: 1440 categories